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Defining regulatory elements governing cell-restricted gene 
expression can be difficult because cis-elements may reside 
tens of kilobases away from start site(s) of transcription. 
Artificial chromosomes, which harbor hundreds of kilobases 
of genomic DNA, preserve a large sequence landscape con- 
taining most, if not all, regulatory elements controlling the 
expression of a particular gene. Here, we report on the use of 
a bacterial artificial chromosome (BAC) to begin understand- 
ing the in vivo regulation of smooth muscle calponin (SM- 
Calp). Long and accurate polymerase chain reaction, se- 
quencing, and in silico analyses facilitated the complete 
sequence annotation of a BAC harboring human SM-Calp 
(hSM-Calp). RNase protection, in situ hybridization, West- 
ern blotting, and immunohistochemistry assays showed the 
BAC clone faithfully expressed hSM-Calp in both cultured 
cells and transgenic mice. Moreover, expression of hSM-Calp 
mirrored that of endogenous mouse SM-Calp suggesting that 
all cis-regulatory elements governing hSM-Calp expression 
in vivo were contained within the BAC. These BAC mice 
represent a new model system in which to systematically 
assess regulatory elements governing SM-Calp transcription 
in vivo, 

promoter; development; genome 



cellular differentiation involves a carefully orches- 
trated program of gene transcription resulting in a cell 
type-specific transcriptome. The completion of the se- 
quencing phase of the Human Genome Project has 
provided important raw data for the mining of regula- 
tory elements governing the expression profile of a 
gene (54). This is of particular importance for those 
genes whose expression is altered in disease states. 



Vascular occlusive diseases, for example, are often 
characterized by the altered expression of a battery of 
smooth muscle cell (SMO-restricted genes (40). Under- 
standing how the normal SMC differentiation program 
is governed represents an important first step toward 
elucidating tie pathways and factors leading to atten- 
uated gene expression in SMC-associated diseases. Ac- 
cordingly, several groups have characterized the ex- 
pression and regulation of SMC-restricted genes in a 
variety of transgenic model systems (8, 11, 14, 15, 
21-23, 25, 26, 28, 34, 44, 46, 55). Whereas the majority 
of these experiments have yielded expected promoter 
activities based on the endogenous gene's pattern of 
expression, some of the findings have been unexpected. 
For example, the endogenous SM22 gene is expressed 
in all three muscle lineages during development and 
then becomes restricted to adult vascular and visceral 
SMC in postnatal life (20). Transgenic mouse studies 
(15, 21, 34), however, have consistently demonstrated 
the virtual exclusive activity of the SM22 promoter in 
arterial SMC of adult animals. In another example, a 
genomic fragment encompassing the smooth muscle 
myosin heavy chain promoter and part of the first 
intron failed to direct expression of a reporter gene in 
vascular SMC of the lung, head, and neck (26). To- 
gether, these reports support the concept of multiple, 
nonoverlapping regulatory elements in and around 
gene loci that are necessary for the complete expres- 
sion profile of a gene. 

An important outgrowth of the Human Genome 
Project that has assisted investigators in defining the 
boundaries of genomic DNA necessary for the complete 
expression of a particular gene has been the develop- 
ment of artificial chromosomes (4, 50). Several studies 
have already exploited these large^capacity cloning 
vectors in the generation of transgenic animals to char- 
acterize DNA sequences necessary for a gene's appro- 
priate pattern of expression. Initial studies using yeast 
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artificial chromosomes showed the appropriate pattern 
of expression for the human p-globin and human ty- 
rosinase genes (42, 49). Subsequent reports using ei- 
ther yeast artificial chromosomes, bacterial artificial 
chromosomes (BACs), or PI phage artificial chromo- 
somes revealed correct spatiotemporal expression _of 
the skeletal muscle regulatory gene myf-5 (57), tne 
existence of an intestinal-specific enhancer located 
55-kb upstream of the human apoiipoprotem B gene 
(36 37), expression of human desmin in all three mus- 
cle 'lineages (45), a repressor of insulin-like growth 
factor-2 transcription located 40-kb downstream of tne 
gene (1), and juxtaglomerular-res trie ted expression of 
the human renin gene (51). Whereas these data dem- 
onstrate the power of using artificial chromosomes to 
capture distal regulatory elements, there have been 
instances where such an approach has failed to com- 
pletely recapitulate an endogenous gene's expression 

Smooth muscle calponin (SM-Calp) is a tightly re- 
stricted gene expressed transiently in the embryonic 
heart, before becoming restricted to all SMC-contain- 
ing tissues (31, 48). In vitro transient transection 
assays have shown that SM-Calp promoter constructs 
display promiscuous activity when introduced into 
non-SMC types (16, 31). On the other hand repeated 
attempts to recapitulate the endogenous SM-Calp s 
pattern of expression in transgenic mice using conven- 
tional lacZ reporter constructs have shown either no 
expression whatsoever or ectopic expression due to 
position effects (unpublished observations, J- M. Mi- 
ano) To circumvent these difficulties, we have adopted 
a strategy of using a BAC harboring the entire human 
SM (hSMVCalp locus to assess expression of SM-Calp 
in an in vitro model system of cellular differentiation 
as well as developing and postnatal transgenic mice. In 
this report, we describe the initial characterization of 
this BAC as well as its appropriately directed expres- 
sion of the hSM-Calp transgene. These studies consti- 
tute an important foundation for the discovery of all 
regulatory elements controlling SM-Calp gene expres- 
sion in vivo. 

MATERIALS AND METHODS 

BAC screening and characterization. A 656-bp polymerase 
chain reaction (PCR) fragment to the hSM-Calp cDNA (for- 
ward primer, GCATGGAGCACTGCGACA and reverse 
primer, CACTGTCACATCCACATAGTATG) was used to 
screen a X4 to X5 high-density arrayed human genomic BAO 
library according to a commercial protocol (Research Genet- 
ics- Huntsville, AL). Two BACs were obtained and subse- 
quently analyzed with PCR primer pairs to the 6' and 3' ends 
of the hSM-Calp transcription unit. One of the BACs (BC- 
100899) contained the complete hSM-Calp locus and was 
subsequently used for all studies. BAC DNA was trans- 
formed into competent bacteria (DHlOBr) and grown in Lu- 
ria-Bertani Medium containing 12.5 ug/ml chloramphenicol. 
DNA purification was by alkaline lyBis, The size of BC- 
100899 was estimated by restriction digestion and pulse-field 
gel electrophoresis (data not shown). The ends of the BAC 
clone were sequenced and a large-scale £coRI restriction map 
generated (see clone BC-100899 at http://bbjp.lml.gov/rmap). 



Long and accurate PCR (TaKaRa, Panvera; Madison. WI) 
waTused to ascertain the position of the SM-Calp locus 
relative to the BAC ends. In addition, we used both long and 
accurate PCR as well as the NCBI workstation to define the 
intron-exon boundaries of each transcription unit within 
BC-100899 (Bee Fig. IB). 

Generation of stable ceil lines. The BC 3 H1 and L6 myoblast 
cell lines (American Type Culture Collection; Manassas, VAJ 
were used to stably integrate BC400899 and assess its 
directed expression of the hSM-Calp transgene in vitro. We 
chose the BC3HI cell line as a model system for studying the 
BAC for two reasons. First, BC3HI are the only known cells 
that reversibly express SM-Calp, making them particularly 
valuable for studying the potential dynamic expression of a 
transgene such as SM-Calp (33), Such regulated expression, 
if conserved in hSM-Calp, would suggest evolutionary con- 
served cw-elements and fra/is-actmg factors whose identifi- 
cation and characterization could be easily earned out in 
such a cell line. Second, BC3HI cells are of mouse origin (31), 
thus facilitating species homogeneity in the evaluation of the 
BAC clone in cell culture and the ^ouse. 

Purified BAC DNA (-100 ug) was mixed with 1 ug of a 
neomyemncontaining plaamid (pCI-Neo, Promega; r Madison, 
WI) using Upofectamine (without Plus Reagent). DNA com- 
plexes were applied to cells in serum-free Dulbecco's modified 
Eagle's medium without antibiotics for 12-16 h. Complexes 
were then aspirated and the cells incubated in serum-con- 
taining medium for 24 h. Cells were then trypsmized and 
replated at varying dilutions (1:20-1:400) in complete me- 
dium containing either 600 ug/ml (BC3HI) or 900 u-g/ml (L6) 
G418 Resistant clones were expanded for PCR genotyping 
using the primers to hSM-Calp listed above as weU^inter- 
nal control primers to the ZP3 gene (forward, ACGCTCTA- 
CATCACCTGCCA, reverse, CACTGGGAAGAGACACTCAG). 
At least three independent clones per cell line were analyzed 
for expression of the hSM-Calp transgene by RT-PCR and 
RNase protection assays (see Generation of transgenic mice). 
A deletion construct was similarly tested by cutting the 
full-length BAC with Notl (see Fig. IB), by religating the 
fragment containing the hSM-Calp transcription unit into 
the pBeloBAC vector, and by electroporating DHlOBr com- 
petent bacteria. 

Generation of transgenic mice. Linearized B&1W899DNA 
was microinjected into the pronucleus of strain FVB-fertil- 
ized oocytes at the Medical College of Wisconsin and Colum- 
bia Transgenic Core Facilities. Tail snips from potential 
founders were obtained and digested at 53°C in lysis buffer 
(10 mM Tris-HCl, pH 8.0; 25 mM EDTA, pH 8.0; 100 mM 
NaCl, 1% sodium dodecyl sulfate, and 200 ug/ml proteinase 
K) for genomic DNA isolation. Initial genotyping was done by 
PCR using the hSM-Calp and ZP3 primers described above. 
Southern blotting and DNA hybridizations were also per- 
formed to estimate BAC copy number and structural integ- 
rity of the BAC clone (data not shown). PCR analysis of the 
BAC clone invariably revealed loss of one or both ends of the 
BAC, which had no bearing on the expression of the BAC 
transgenes. Both transgenic lines reported here have been 
maintained stably on the FVB background for four genera- 
tions. 

RNase protection assays. Transgene expression analysis 
required the isolation of total RNA from cells and animal 
tissues using the acid-phenol guanidinium isothiocyanate 
procedure (5). As a first estimate of hSM-Calp expression, we 
performed RT-PCR using human-specific primers flanking 
two introns of the hSM-Calp cDNA (forward, CGCCAACT- 
TCATCAACGCCAT and reverse, CACTGTCACATCCACAT- 
AGTATG). In addition, we analyzed the expression of the 
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Kg. 1. Physical map of BC- 100899 cont aining 
the human smooth muacle-calponin (hSM-Calp) 
locus. A: 2,664,311 bp contig to chromosome 
19pl3.2 encompassing BC-100899 (bold under- 
lined region) waa used to generate the indicated 
map. Note the gene-rich regions separated by 
so-called "deserts" of gene-poor, repetitive se- 
quences (denoted as gaps in the linear map). The 
accession number of this contig is NT-011176. B ; 
restriction map of BC-100899 revealing the com- 
plete gene structures of LOC11S950, evolution- 
arily conserved signaling intermediate in Toll 
pathways (ECS1TX hSM-Calp and MGC4549 as 
well as a partial gene structure to HuR. Less 
than 10 kb of intergenic sequence separates 
hSM-Calp from ECSIT. In silico analysis of this 
bacterial artificial chromosome (BAC) revealed 
no rearrangements or alterations in nucleotide 
sequence of the indicated transcription units. 
The centrally located Notl site (N) waa used to 
generate a deletion BAC clone for stable integra- 
tion of cells (see Fig. 3). Solid boxes represent 
numbered exons. The direction of transcription 
is indicated by the arrow under or over the gene 
name. 



MCC4549 



endogenous mouse SM-Calp gene (forward, ATGTCTTCTG- 
CACATTTTAAC, reverse, TCAATCCACTCTCTCAGCTCC), 
human ECSIT (forward, CTGGAGCAGATGGAGAACCAC 
and reverse, TAGATGGTGACCCTGGCACT), mouse ECSIT 
(forward, CTGGAGCAGATGGAACGACAC and reverse, CCA- 
TGCCTGTCGAGTCACTGG), human MGC4649 (forward, 
GCTCATCCACCTGCAGACATG and reverse, CACGGTA- 
CAAGAGATGACTCCG), and human LOC115950 (forward, 
ATGGCAGCAGGCAGCGGTG and reverse, GCACCGTTGT- 
TGCCAGCTC). As a control, we also examined the expres- 
sion of glyceraldehyde phosphate dehydrogenase (forward, 
GCCAAAAGGGTCATCATCTC and reverse, GGCCATCCA- 
CAGTGTTGT). Total RNA (~3 ag) was reverse transcribed 
at 37°C for 1 h with a First Strand cDNA Synthesis Kit 
(Amersham Pharmacia Biotech), as described in the manu- 
facturer's protocol. After reverse transcription, the cDNA 
templates were individually amplified with the above strand- 
specific primers. Products were resolved in a 1% agarose gel 
and visualized by ethidium bromide staining. 

To assess the adult tissue distribution of hSM-Calp more 
rigorously, we performed RNase protection assays with a 319 
NT human-specific riboprobe (32) labeled with f^PlUTP as 
per the manufacturer's instructions (Amnion; Austin, TX). 
Samples of tissue RNA from two independent transgenic 



mouse lines were hybridized with labeled riboprobe, RNase 
A/T digested, and then resolved In a 6% denaturing poly- 
acrylamide gel. Signal intensities were detected by autora- 
diography (X-OMAT film, Eastman Kodak; Rochester, NY). 

Western blotting. Cell monolayers were washed twice with 
cold phosphate-buffered saline and then scraped in extrac- 
tion buffer containing 55 mM Tris*HCl (pH 6.8), 10% glyc- 
erol, 2% sodium dodecyl sulfate. 10 mM dithiothreitol, 0.5 
mM EDTA, 100 jtg/ml phenylmethylsulfonyl fluoride, and 1 
jig/ml each of pepstatin A, leupeptin, and aprotinin. Alterna- 
tively, snap-frozen tissues from transgenic or nontransgenic 
mice were homogenized in the same buffer and spun at 
10,000 g for 20 min for supernatant isolation. Samples of cell 
or tissue lysates were sheared through a 23-gauge needle and 
then boiled for 5 min. Protein concentration was determined 
by the bicinchoninic acid assay (Pierce). Initially, samples of 
protein extracts (25-50 pg) were resolved through 10% poly- 
acrylamide gel and stained with Coomassie blue to confirm 
sample integrity and equal loading. Parallel samples were 
then electroblotted to nitrocellulose (Bio-Rad) and processed 
for Western blotting as described (31). The SM-Calp antibody 
(clone hCP, Sigma) was used at a dilution of 1:2,500. As a 
control, we incubated blots with an antibody to the SH2- 
adapter protein, Grb2 (c-7, Santa Cruz), or ^-tubulin (clone 
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5H1 FharMingen). Specific immunoreactive proteins were 
revealed on X-ray film with enhanced chemUumineacence 
reagents (Ameraham Pharmacia Biotech; Piscataway, NJ). 

Immunohistochemistry. To assess spatial expression of the 
hSM-Calp protein, sections of stomach, uterus, and liver 
were generated from transgenic and nontransgemc mice for 
immunohistochemistry using a mouse an ti-hSM-Calp mono- 
clonal antibody (DAKO; Carpinteria, CA). Antigen retrieval 
was achieved by microwave heating tissue sections in a 
solution of 0.1 M citrate buffer (pH 6.3, Zymed; San Fran- 
cisco, CA) for 10 min. Sections were preblocked for 30 rnin m 
normal horse serum, washed and incubated overnight at 4 ^ 
with the primary SM-Calp antibody (1:100 dilution). Avidm- 
Biotin Complex reagents (Vector Laboratories, Burlingame, 
CA) were then used to reveal hSM-Calp protein with the use 
of 3-amino-9-ethylcarbazole as substrate. 

In situ hybridization. Sections (6 Jim) of transgenic and 
nontransgenic adult tissues and developmental^ staged 
mouse embryos were used to define the spatial and temporal 
pattern of hSM-Calp mRNA expression. In prehminary stud- 
ies, we tested several hSM-Calp riboprobes for any evidence 
of cross-hybridization to the endogenous mouse SM-Calp 
transcript. The data shown in this report made use of a 3 
untranslated riboprobe (260 bp) that was generated with 
PCR primers to the hSM-Calp cDNA (forward, GCTA- 
CAGGGTCCAACATAGAAC, reverse, CACTGTCACATCCA- 
CATAGTATG). This riboprobe is 77% homologous to the 
corresponding mouse sequence but yielded some cross-hy- 
bridization to the endogenous mouse SM-Calp mRNA in 
developing embryos. Sense and anti-sense riboprobes were 
linearized and used in vitro transcription assays to generate 
^P-labeled riboprobes (MaxiScript, Ambion). Paraffin-em- 
bedded, paraformaldehyde-flxed tissue sections were depar- 
afflnized, rehydrated, and treated with 6 jig/ml proteinase K 
at 37°C for 6 min. Hybridization with -3 x 10 7 counts/mm of 
probe per milliliter of hybridization solution was performed 
overnight at 52°C in a humidified chamber. Slides were then 
washed to remove nonspecificaUy bound probe, treated .with 
20 oEtol RNase A at 37°C for 30 min, dehydrated, air-dned, 
and dipped in emulsion (Kodak NTB2). After 1 wk, slides 
were developed in Kodak D19 developer, fixed, and counter- 
stained with Mayer's hematoxylin. Darkfield images were 
generated with a Polaroid digital camera (DMC-1) using 
hnage<apture software (DMC version 2.0) and processed in 
Adobe Photoshop. 

RESULTS 

Mapping of transcription units in *£-*W^ The 
hSM-Calp locus maps to chromosome 19pl3.2 U / , oz). 
Figure 1A shows a schematic of a >2.6 Mb map encom- 
passing BC-100899 on chromosome 19pl3.2. Chromo- 
some 19 has the highest density of both genes (=-26 
genes per Mb DNA) and repetitive sequence in the 
human genome (13, 53). Both points are illustrated m 
this large contig with BC-100899 alone harboring five 
transcription units (Fig. 1A). A higher resolution map 
of the 103,424 nt BC-100899 is provided in Fig. IB. As 
reported previously (32), hSM-Calp comprises 7 exons 
spanning 11,473 nt. The hSM-Calp locus in BC-100899 
is flanked by 9,194 nt of 3' sequence and 82,879 nt of 
5' sequence. A divergently transcribed, EST-supported 
gene comprising 5 exons over 2,421 nt of DNA 
(MGC4649, accession number NM-032377) resides at 
the 3' end of BC-100899 immediately after hSM-Calp 
(Fig. IB). Three transcription units were discovered 5' 



of hSM-Calp and each of these initiates transcription 
on the opposite strand of DNA. Human evolutionary 
conserved signaling intermediate in Toll pathways 
(ECSIT) (17) is transcribed only 9,678 nt away from 
the start sites of hSM-Calp (Fig. IB). The ECSIT tran- 
scription unit comprises 8 exons spanning 23,220 nt. A 
recently predicted mRNA having a zinc fingerlike do- 
main (LOC115950, accession number XM-057078) and 
consisting of 9 exons over 22,390 nt practically over- 
laps with ECSIT. Finally, a portion of the HuR tran- 
scription unit (24) encompassing 5 exons spans nearly 
25 kb of the remaining 5* sequence of BC-100899 

^Regulated expression of hSM-Calp in BC3HI cells. 
Recently, we (33) reported downregulated expression 
of SM-Calp in BC3HI cells induced to differentiate into 
a skeletal muscle-like lineage, a finding consistent 
with the absence of SM-Calp expression in developing 
somites and mature skeletal- muscle. To determine 
whether hSM-Calp displays a similar regulatory pat- 
tern of expression, we generated BC3HI cell lines 
transfected stably with the BC-100899 clone. Such 
cells express hSM-Calp mRNA in a growing state (Fig. 
2A). In a manner identical to the endogenous mouse 
SM-Calp gene (33), hSM-Calp transcripts decline on 
cell difFerentiation and the adoption of a skeletal mus- 
cle-like fate (Fig. 2A). We have observed a similar 
pattern of mRNA expression in a total of three mde- 
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Fig 2 Expression of hSM-Calp in BAC-transfected BCsHl ceUs. 
Cells were stably transfected with BC-100899 and then analyzed for 
expression of the hSM-Calp transgene by RNase protection assay (A) 
or Western blotting (B). Note that both hSM-Calp mRNA and protein 
were present in growing (G) BCaHl cells (but not mock transfec- 
tants) only to decrease on differentiation (D). Total RNA or protein 
from human aorta and the human uterine smooth muscle cell (SMC) 
line (MyoLTR) serve as positive controls for hSM-Calp. Equivalent 
amounts of total RNA and protein were verified with 18 S and Grb2, 
respectively. 
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Fig. 3. Expression of deletion mutant BAC in stable BCsHl cells. 
Several independent cell lines (A7-9, and 14) carrying the same 
deletion of BO 100899 (5' boundary of deletion is at the novel Notl 
site depicted in Fig. IB) were selected in G418 medium and their 
total RNA isolated for RT-PCR analysis. Top: three of the four 
deletion mutant lines display a prominent signal in growing cells, 
which decreases when cells are induced to differentiate. In contrast, 
mock-transfected cells and line A14 did not express the hSM-Calp 
transgene (Southern blot analysis of line AH revealed a different 
restriction digestion pattern of the BAC clone, suggesting the clone 
was altered in some manner, data not shown). Bottom: giyceralde- 
hyde-3-pnosphate dehydrogenase control product, which is equiva- 
lent across all lanes. 

pendent BC3HI stable cell lines (data not shown). 
Importantly, stably transferred L6 myoblasts, which 
do not express endogenous SM-Calp (31), rarely (one 
line out of >10) exhibited positive expression for the 
hSM-Calp transgene (data not shown). Western blot- 
ting studies demonstrated an immunoreactive band 
corresponding to hSM-Calp protein that was also 
downregulated on differentiation of BC3HI cells (Fig. 
2B). Deletion of the full-length BAC to a Notl site (see 
Fig. LB) revealed appropriate expression regulation of 
hSM-Calp mRNA (Fig. 3), suggesting that the regula- 
tory elements for SM-Calp expression are contained 
within about 50 kb of genomic sequence. Collectively, 
these findings demonstrate the transcriptional and 
translational competence of the hSM-Calp transgene 
in BC3HI cells. Moreover, the hSM-Calp transgene 
appears to behave in the same manner as the endoge- 
nous mouse SM-Calp gene suggesting that the human 



locus contains the same BC 3 H1 differentiation-respon- 
sive regulatory elements as the endogenous mouse 
SM-Calp locus. 

hSM-CcUp is restricted to SMC-rich tissues in adult 
BAC transgenic mice. We generated two independent 
transgenic mouse lines (each with two copies of BC- 
100899 as assessed by Southern blotting, data not 
shown) to evaluate expression of the hSM-Calp trans- 
gene in adult tissues. As shown in Fig. 4, mRNA 
expression of hSM-Calp is essentially identical to that 
reported previously (Fig. 1A in Ref. 31) with bladder, 
intestine, stomach, and uterus showing the highest 
steady-state transcript leveL No hybridization signal 
was observed in nontransgenic bladder and uterus 
(Fig. 4). Levels of endogenous SM-Calp mRNA did not 
appear to be affected by the hSM-Calp locus, suggest- 
ing that there is no significant promoter competition 
between the endogenous mouse gene and the hSM- 
Calp transgene (data not shown). Consistent with a 
previous report in mice (17), human ECSIT, which is 
<10 kb away from the hSM-Calp locus, is widely ex- 
pressed in adult tissues (Fig. 5). Moreover, RT-PCR 
confirmed the mRNA expression of the hypothetical 
genes, LOC115950 and MGC4549 (data not shown). 

We then performed in situ hybridization to assess 
the spatial expression of hSM-Calp in adult tissues. 
This analysis revealed expression of hSM-Calp mRNA 
in medial SMC of the aorta, visceral SMC of the blad- 
der and stomach and bronchiolar SMC of the lung (Fig. 
6). Correct spatial expression of hSM-Calp was also 
observed in the intestine and uterus whereas the 
brain, heart, spleen, skeletal muscle, and liver showed 
only vascular SMC expression (data not shown). 

Western blotting studies revealed expression of 
hSM-Calp protein in such SMC-rich tissues as stomach 
and uterus (Fig. 7). We exploited the higher affinity of 
the antisera for human SM-Calp to determine whether 
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Pig. 4. RNase protection assay of hSM- 
Calp mRNA expression in adult BAC 
transgenic mouse tissues. A transgenic 
mouse line (T) was analyzed for ex- 
pression of the hSM-Calp transgene 
and compared with two nontransgenic 
(NT) tissues (bladder and uterus) us- 
ing a human-specific riboprobe. The 
tissue-restricted pattern and level of 
hSM-Calp expression was identical to 
that previously reported for the endog- 
enous mouse SMrCalp gene (Fig. 1A in 
Ref. 31). As expected, no hybridization 
signal was observed in nontransgenic 
bladder and uterus (NT). The 18 S sig- 
nal shows relatively equal total RNA 
loading across all lanes. Results were 
reproduced in a second transgenic line. 
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Fig 5 Human ECSTT (hECSIT) expression in transgenic mouse 
tissues. Total RNA from various non-SMC tissues was isolated for 
RT-PCR analysis with the use of primers that distinguish human 
from mouse ECSIT (mECSIT). Note the widespread expression of 
both hECSIT and mECSIT. 



the hSM-Calp protein was expressed in SMC of these 
tissues. As shown in Fig. 8, C and D, immunoreactive 
hSM-Calp was readily detected in both visceral and 
vascular SMC of the stomach in BAC transgenic mice. 
Under the conditions employed, little signal was ob- 
tained in nontransgenic littermates (Fig. 8, A and Bl 
Collectively, the results from adult BAC transgenic 
mice indicate that hSM-Calp is expressed in a manner 
analogous to the endogenous mouse gene at both the 
mRNA and protein levels. 

hSM-Calp expression mirrors endogenous SM-Calp 
during development- In situ hybridization studies in 
staged transgenic mouse embryos revealed a pattern of 
hSM-Calp expression that was very similar to the 
endogenous SM-Calp gene (31, 48). For example, ro- 
bust expression of the hSM-Calp transcript was noted 
in the heart at embryonic (e) 9.5 days with evidence of 



Pig. 6. In situ hybridixation analysis of hSM-Calp 
mRNA in adult BAC transgenic mouse tissues. Dark- 
field microscopy images of transgenic <A.C,E,G) vs. 
nontransgenic (B,DJJf) adult tissues processed for 
in situ hybridization using a human-specific ribo- 
probe. Note the restricted expression of hSM-Calp to 
vascular SMC of the aorta (A) and visceral SMC of 
bladder (O, bronchi ($) and stomach (G). Arrows 
indicate region of visceral SMC in nontransgenic 
bladder CD) and stomach (G and /J). F represents 
section of a nontransgenic lung, b, Bronchiole. 
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FV 7 Western blot analysis of hSM-Calp protein in adult BAG 
transgenic mouse tissues. Nontransgenic (N) and transgenic (T) 
stomach and uterus were processed for Western blotting and incu- 
bated with antieera either to SM-Calp or p-tubulin. The much h^her 
signal intensity of the transgenic vs. nc^transgenk tissues reflects 
uTaffinity of the antibody (raised to human SM-Calp) rather than 
absolute levels. Non-SMC tissues such as liver showed no bSM-Caip 
expression (data not shown). 

dorsal aortic SMC expression (Fig. 9, A-D). Intense 
heart and dorsal aortic expression continued through 
el2.5 days (Fig. 9, Widespread visceral SMC 

expression of the hSM-Calp transgene was observed in 
the bladder, gut, esophagus, and bronchi at el4.5-el5.5 
days (Fig. 9, J-P). Interestingly, expression of hSM- 
Calp mRNA in the heart occurs as late as el5.6 days, a 
time in which the expression of the endogenous mouse 
SM-Calp gene has all but vanished in this tissue (Fig. 
9 M-P) (31). Although we used a riboprobe to the 3' 
untranslated region of hSM-Calp (77% homology to 
mouse), some cross-hybridization to the endogenous 
mouse SM-Calp mRNA was observed in nontransgenic 
Httermates (Fig. 9, G and K). However, the transgenic 



embryos clearly exhibit a more intense hybridization 
signal at all gestational time points. Taken in aggre- 
gate, the results of these hSM-Calp transgenic studies 
show a near complete recapitulation of expression of 
the endogenous mouse gene. 

DISCUSSION 

A major innovation of the Human Genome Project 
has been the development of BACs, which harbor large, 
stable genomic fragments of DNA (50). BACs have 
been the workhorses in the sequencing of the human 
genome (12, 13, 53). In addition, BACs have been used 
to complement genetic mutations leading to gene iden- 
tification (2, 43) and to pinpoint distal regulatory ele- 
ments that confer cell-restricted gene expression in 
transgenic mice (37). The latter approach has been of 
particular interest inasmuch as many muscle-re- 
stricted transcription units are governed by cis-acting 
elements that may reside tens of kilobases away from 
the core promoter region (9, 47, 57). Denning such 
distal elements through small phage genomic clones 
can be time consuming and may have untoward effects 
on the expression of the reporter transgene (see below). 
In this report we used a BAC harboring the hSM-Calp 
transcription unit to show first that the expression of 
hSM-Calp exhibited similar regulatory expression in 
an in vitro model of cellular differentiation wherein 
expression of SM-Calp is attenuated on serum with- 
drawal and differentiation (33). The same BAC was 
then integrated into the mouse genome to assess hSM- 
Calp expression in an in vivo context where the trans- 




Fig. 8. Immunolocalization of hSM- 
Calp in vascular and visceral SMC of 
the stomach. Nontransgenic (A and B) 
and transgenic (C and D) stomach tis- 
sues were processed for immunohisto- 
chemistry with 3-amino-9-ethylcarba- 
zole as substrate (red color). Note that 
under the conditions used, the anti- 
body to SM-Calp preferentially detects 
the hSM-Calp protein in both vascular 
U) and visceral (arrow) SMC of the 
BAC transgenic stomach (compare B 
with D). Magnifications were X40 CA 
and O and xio4 (B and D). 
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Pig 9 In situ hybridization ofhSM-Calp mRNA in developmental^ staged mouse embryos. Saggital sections from 
e96 (A-D) el2.5 (JB-H), el4.6 tf-L>, or e!5.5 (W-P) day embryos of transgenic (A, 2*, P, /. /. and AO or 
nontransgenic (C, D t Q, H, K, L, O, and P) origin were hybridized to an antisense hSM-Calp nboprobe and 
processed for darkfield (A, C, E, G. /. K t M , and O) or brightfield IB. I>, P. H . J, L N t and P) microscopy. bSM-Calp 
is robustly expressed in embryonic heart (arrows) as early as e9.5 days CA and B) and persists through el5.5 days 
(Af and JV), a time at which the endogenous mouse SM-Calp gene is turned off in the heart (Q and P) and (31). 
High-level hSM-Calp expression is also observed in the dorsal aorta (arrowheads in A, B, E,F,l, and J), bladder 
and gut (/ «/, AT. and JV), and esophagus tf and J). For unclear reasons, cross-hybridization of the hSM-Calp 
nboprobe to endogenous mouse SM-Calp was evident in some of the nontransgenic embryos (heart in G and H as 
well as bladder and gut in K, L, O t and P). bt, Bladder, es f esophagus; g, gut See text for more details. 



gene would be subject to the complex signaling cues 
that normally converge on a transcription unit within 
highly ordered chromatin. Expression of the hSM-Calp 
transgene was restricted to both vascular and visceral 
SMC of adult transgenic mice, a result that contrasts 
with SM22 and CRP1 promoter mice, which only ex- 
pressed a lacZ reporter gene in arterial SMC of adult 
mice (15, 21, 23, 34). During development, the expres- 
sion of hSM-Calp mirrored that of the endogenous 
mouse gene with one notable exception; cardiac expres- 
sion of hSM-Calp appeared to persist longer than the 
endogenous mouse gene (Fig. 9) and (31). Consistent 
with endogenous data (31, 48), hSM-Calp is not ex- 



pressed in developing somites, a finding that contrasts 
with all other SMC markers known to be expressed in 
sarcomeric muscle. It will be of interest to determine 
the transcriptional mechanisms for such somitic silenc- 
ing as well as the protracted expression of hSM-Calp in 
developing cardiac muscle. 

How might one begin to define critical regulatory 
elements governing the complete expression profile of 
hSM-Calp from a >100-kb BAC? One approach would 
be to serially delete the BAC clone, a process that could 
identify distal elements within intergenic regions or 
even neighboring transcription units as appears to be 
the case with the skeletal muscle transcription factor 
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MRF4 (57). Here we made an aggressive deletion of the 
BAC and showed that proper hSM-Calp egression 
remained in the BC 3 H1 cell line. The BC3HI model 
system of SM-Calp gene regulation therefore repre- 
sents a fast and easy system to identify candidate 
deletion clones that would ultimately require testing in 
vivo. Another approach of locating discrete cis ele- 
ments in artificial chromosomes involves the random 
cloning and testing of BAC fragments in an enhancer 
trap assay. Studies to date using this approach, how- 
ever have failed to identify SMC-specific enhancers in 
the BAC containing hSM-Calp. An alternative ap- 
proach that is increasingly emerging aa a powerful tool 
to uncover important cis elements in the genome is 
through -phylogenetic fingerprinting,* in which com- 
parative sequence analysis is undertaken between two 
species (e.g., mouse vs. human) (64). We previously 
used such an analysis to show the existence of several 
evolutionarily conserved CArG boxes in the first nitron 
of SM-Calp, which appear to confer SMC-specific en- 
hancer activity in vitro (30). An important issue to 
investigate therefore will be whether these 
CArG boxes direct SMC-specific expression of the hSM- 
Calp transgene in vivo. Such point mutation experi- 
ments are possible using methods of homologous re- 
combination (35). . 

Although the use of BACs for purposes of defining 
regulatory elements may appear to be a somewhat 
daunting endeavor, we favor this approach over con- 
ventional lacZ studies for several reasons. First, regu- 
latory elements may reside considerable distances 
from the core promoter, making conventional genomic 
phage approaches both time and labor intensive. Ex- 
perience with such muscle-restricted promoters as 
SM22 (15, 21, 34), CRP1 (23), Csx/Nkx2.5 (52), 
MLC1F/3F (7), myf-5 (41), and desmin (18), to name a 
few, indicate the multiplicity of regulatory cassettes in 
and around gene locL Artificial chromosomes offer the 
opportunity to potentially capture all regulatory ele- 
ments of a gene, thus simplifying the analysis of gene 
regulation. This is of particular importance, given the 
fact that distal elements are likely to interact with one 
another and the core promoter to facilitate proper gene 
transcription. Second, incorporation of an artificial re- 
porter gene such as lacZ downstream of promoter se- 
quences alters the native sequence landscape of a gene 
and may thus disrupt important spatial or sequence 
configurations that must be preserved for proper gene 
transcription (10). Alternatively, epigenetic alterations 
(e.g., methylation) in the promoter sequences or re- 
porter itself could compromise reporter readout and 
hence confound interpretations of true promoter activ- 
ity (6, 38). In this context, we have recently observed 
transgenic mice carrying a 1.4-kb SM22 promoter 
fused to lacZ to express the reporter appropriately 
during development only to lose expression entirely 
during postnatal development (unpublished data, J. M. 
Mianb). These artifacts of reporter gene expression are 
greatly minimized when the reporter is the natural 
transcription unit itself in the context of a large 
genomic landscape. Ultimately, definitive proof of the 



importance of a regulatory element in governing 
proper gene expression will require gene-targeted ab- 
lation studies (39). 

Another major advantage of BACs is that they often 
contain more than one transcription unit, which pro- 
vides important intergenic sequence information. 
There is a growing consensus that intergenic se- 
quences harbor important regulatory elements that 
facilitate proper gene transcription. How, for example, 
can SM-Calp be restricted to SMC lineages when it is 
less than 10 kb away from the widely expressed ECSIT 
transcription unit? One way of attaining this impor- 
tant control is through intergenic boundary elements 
(e.g., insulators) that serve to shield a locus from the 
effects of an adjacent gene's promiscuous transcrip- 
tional apparatus (3). In silico analyses have failed to 
reveal the presence of a consensus enhancer-blocking 
boundary element (3) in the intergenic region between 
SM-Calp and ECSIT. It is possible, however, that other 
boundary elements exist in this region that will require 
more extensive computer and wet-lab analyses to un- 
cover. 

In addition to showing SMC-restricted hSM-Calp 
mRNA expression, we provide evidence for both vascu- 
lar and visceral SMC expression of the human SM- 
Calp protein. There are only 8 amino acid substitutions 
over the 297 amino acid primary sequences of mouse 
and human SM-Calp (6 are conservative). Thus it is 
reasonable to assume that the human SM-Calp protein 
will function in an identical manner as the endogenous 
mouse protein. These BAC transgenic mice therefore 
will be useful to breed to homozygosity and assess the 
physiological effects of SM-Calp over-expression in 
vivo. Moreover, the present BAC could be used to 
rescue the bone and vascular phenotypes observed in 
the SM-Calp null mice (29, 56). Interpreting such stud- 
ies, however, could be confounded by the presence of 
other human transgenes on the BAC (e.g., hECSIT). 

In summary, we have used a BAC harboring the 
hSM-Calp locus to show, for the first time, the complete 
recapitulation of the endogenous SM-Calp's promoter 
activity in developing and postnatal transgenic mouse 
tissues. Future studies will examine the importance of 
known (intronic CArG elements) and as yet to be de- 
fined regulatory elements in conferring SMC-restricted 
expression of hSM-Calp in the context of the present 
BAC. These and other studies will be instrumental in 
gaining a complete fingerprint of the transcriptional 
program underlying SMC differentiation in vivo and 
how this program of differentiation, is perturbed in 
disease states. 
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